Acoustic Model Adaptation for Recognition of Dysarthric Speech by Harsh

نویسندگان

  • Harsh Vardhan Sharma
  • VARDHAN SHARMA
  • Kumkum Sharma
  • Krishna Kant Sharma
چکیده

Speech production errors characteristic of dysarthria are chiefly responsible for the low accuracy of automatic speech recognition (ASR) when used by people diagnosed with it. The results of the small number of speech recognition studies, mostly conducted by assistive technology researchers, are a testimony to this statement. In the engineering community, substantial research has been conducted to find algorithms that adapt models of speech acoustics trained on one dataset for use with another. They are mostly mathematically motivated. A person with dysarthria produces speech in a rather reduced acoustic working space, causing typical measures of speech acoustics to have values in ranges very different from those characterizing unimpaired speech. It is unlikely then that models trained on unimpaired speech will be able to adjust to this mismatch when acted on by one of the above-mentioned adaptation algorithms. The creation of acoustic models trained exclusively on pathological speech too is a task difficult to achieve: members of this population find it tiring to pursue physical activities for sustained periods of time, including speech production. While this makes speaker adaptation an approach worthy of pursuit, almost no research has been conducted so far on acoustic model adaptation methods for recognition of dysarthric speech. This dissertation presents a study of acoustic model adaptation for recognition of dysarthric speech. First, it investigates the efficacy of a popular adaptation algorithm for dysarthric speech recognition. It then proposes an additional step in the adaptation process, to separately model ‘normal’ and pathology–induced variations in speech characterisitics; and does so by trying to account for a recently proposed view of the acoustics of motor speech disorders in the clinical research community. Results show that explicitly addressing the population mismatch helps to increase the recognition accuracy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Acoustic model adaptation using in-domain background models for dysarthric speech recognition

Speech production errors characteristic of dysarthria are chiefly responsible for the low accuracy of automatic speech recognition (ASR) when used by people diagnosed with it. A person with dysarthria produces speech in a rather reduced acoustic working space, causing typical measures of speech acoustics to have values in ranges very different from those characterizing unimpaired speech. It is ...

متن کامل

Maximum Likelihood Linear Regression (MLLR) for ASR Severity Based Adaptation to Help Dysarthric Speakers

Automatic speech recognition (ASR) for dysarthric speakers is one of the most challenging research areas. The lack of corpus for dysarthric speakers makes it even more difficult. The speaker adaptation (SA) is an alternative solution to overcome the lack of dysarthric speech and enhance the performance of ASR. This paper introduces the Severity-based adaptation, using small amount of speech dat...

متن کامل

Universal Access : Experiments in Automatic Recognition of Dysarthric Speech by Harsh

This thesis describes the results of first experiments in small and medium vocabulary dysarthric speech recognition, using the database being recorded by the Statistical Speech Technology Group under the Universal Access initiative. Speaker-dependent, wordand phone-level speech recognizers utilizing the hidden Markov model architecture were developed and tested; the models were trained exclusiv...

متن کامل

Correction: Severity-Based Adaptation with Limited Data for ASR to Aid Dysarthric Speakers

Automatic speech recognition (ASR) is currently used in many assistive technologies, such as helping individuals with speech impairment in their communication ability. One challenge in ASR for speech-impaired individuals is the difficulty in obtaining a good speech database of impaired speakers for building an effective speech acoustic model. Because there are very few existing databases of imp...

متن کامل

Dysarthric speech recognition using dysarthria-severity-dependent and speaker-adaptive models

Dysarthria is a motor speech disorder that impairs the physical production of speech. Modern automatic speech recognition for normal speech is ineffective for dysarthric speech due to the large mismatch of acoustic characteristics. In this paper, a new speaker adaptation scheme is proposed to reduce the mismatch. First, a speaker with dysarthria is classified into one of the pre-defined severit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012